A Parallel FP-Growth Mining Algorithm with Load Balancing Constraints for Traffic Crash Data
نویسندگان
چکیده
Traffic safety is an important part of the roadway in sustainable development. Freeway traffic crashes typically cause serious casualties and property losses, being a threat to public safety. Figuring out potential correlation between various risk factors revealing their coupling mechanisms are effective ways explore identity freeway crash causes. However, existing association rule mining algorithms still have some limitations both efficiency accuracy. Based on this consideration, using data obtained from WDOT (Washington Department Transportation), research constructed multi-dimensional multilevel system for analysis. Considering load balancing, FP-Growth (Frequent Pattern- Growth) algorithm was optimized parallelly based Hadoop platform, achieve efficient accurate calculation massive amounts data; then, according results mechanism among precursors, causes were identified revealed. The show that parallel FPgrowth with balancing constraints has better operating speed than conventional FP-growth towards processing big data. This improved makes full use cluster resources more suitable large sets while retaining original advantages algorithm. In addition, rules model improvement interaction proposed can catch occurrence consequences (lower support degree probably) accurately efficiently.
منابع مشابه
A New Load Balancing Approach for Parallel FP-Growth
Due to the exponential growth in worldwide information, companies have to deal with an ever growing amount of digital information. So the huge size of data and computation volume of new processing applications such as data mining, leads to new high performance parallel processing systems. One of the most important challenges of such application is quickly and correctly finding the relationship ...
متن کاملLPAS: High Efficiency Load Balancing Parallel Data Mining Algorithm
Association rule discovery plays an important role in knowledge discovery and data mining, and efficiency is especially crucial for an algorithm finding frequent itemsets from a large database. Many methods have been proposed to solve this problem. In addition, parallel computing has been a popular trend, such as on cloud platform, grid system or multicore platform. In this paper, a high effici...
متن کاملLoad Balancing Approach Parallel Algorithm for Frequent Pattern Mining
Association rules mining from transaction-oriented databases is an important issue in data mining. Frequent pattern is crucial for association rules generation, time series analysis, classification, etc. There are two categories of algorithms that had been proposed, candidate set generate-and-test approach (Apriori-like) and Pattern growth approach. Many methods had been proposed to solve the a...
متن کاملDynamic load balancing for parallel traffic simulation
In this paper, we describe implementation of the Parallel Traffic Simulator which applies parallel algorithms and computer graphics in the field of traffic simulation. It simulates traffic behaviour at the individual vehicle level. Parallel environment used consists of a set of workstations connected in a network. Networked workstations make up a loosely coupled parallel computer architecture w...
متن کاملOn Choosing a Load-Balancing Algorithm for Parallel Systems with Temporal Constraints
A key point in parallel systems design is the way clients requests are forwarded and distributed among the servers, trying to obtain the maximum throughput from them or, in other words, the load-balancing policy. Although it is a largely studied theme, with well accepted solutions, the inclusion of temporal constraints, also denoted as deadlines in this work, to the requests brings new complexi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computers Communications & Control
سال: 2022
ISSN: ['1841-9844', '1841-9836']
DOI: https://doi.org/10.15837/ijccc.2022.4.4806